Comparing Human and Automated Evaluation of Open-Ended Student Responses to Questions of Evolution
نویسندگان
چکیده
Written responses can provide a wealth of data in understanding student reasoning on a topic. Yet they are timeand laborintensive to score, requiring many instructors to forego them except as limited parts of summative assessments at the end of a unit or course. Recent developments in Machine Learning (ML) have produced computational methods of scoring written responses for the presence or absence of specific concepts. Here, we compare the scores from one particular ML program – EvoGrader – to human scoring of responses to structurallyand content-similar questions that are distinct from the ones the program was trained on. We find that there is substantial inter-rater reliability between the human and ML scoring. However, sufficient systematic differences remain between the human and ML scoring that we advise only using the ML scoring for formative, rather than summative, assessment of student reasoning.
منابع مشابه
Assessing creative problem-solving with automated text grading
The work aims to improve the assessment of creative problem-solving in science education by employing language technologies and computational–statistical machine learning methods to grade students’ natural language responses automatically. To evaluate constructs like creative problem-solving with validity, open-ended questions that elicit students’ constructed responses are beneficial. But the ...
متن کاملLaboratories Performance after Outsourcing in the Hospitals of Shahid Beheshti University of Medical Sciences
Abstract Background and Objective: Nowadays, downsizing the government to have an effective and flexible organization is considered to be government’s top priority in the world and outsourcing is one of the ways to achieve this goal. Accordingly, Shahid Beheshti University of Medical Sciences has delegated some of its hospitals' duties to the private sectors. The present study has been carried...
متن کاملIranian New Junior High School Book (Prospect 1) Weighted against Material Evaluation Checklist from Teachers' Perspective
The aim of this study was to evaluate the new version of Iranian EFL junior high school textbook (Prospect1) from the teachers’ perspectives. The participants included90experienced English teachers (42 females and 48 males) randomly selected from different junior high schools in different districts of Gilan province, Iran. The evaluation of the textbook was conducted quantitatively through a 5-...
متن کاملLaboratories Performance after Outsourcing in the Hospitals of Shahid Beheshti University of Medical Sciences
Abstract Background and Objective: Nowadays, downsizing the government to have an effective and flexible organization is considered to be government’s top priority in the world and outsourcing is one of the ways to achieve this goal. Accordingly, Shahid Beheshti University of Medical Sciences has delegated some of its hospitals' duties to the private sectors. The present study has been carried...
متن کاملReflection perspectives of Tabriz Nursing Student
Introduction: The phenomenon of knowledge explosion has led teachers to feel the necessity of training students so that they become reflective thinkers. This issue is more important for nursing students who are responsible for providing care for patients.This study is a part of another study arming at exploration of Nursing Students’ views on reflection on practice. Methods. 20 senior nursing...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- CoRR
دوره abs/1603.07029 شماره
صفحات -
تاریخ انتشار 2016